A Weighted Hybrid Thresholding Approach for Text Binarization

نویسندگان

  • S. T. Deepa
  • S. P. Victor
  • N. J. Leite
  • Christian Wolf
  • David Doermann
  • Celine Mancas
  • Bernard Gosselin
  • C. V. Jawahar
چکیده

Text extraction in real images taken in unconstrained environments remains surprisingly challenging in Computer Vision due to language characteristics, complex backgrounds and the text color. Extraction of text and caption from images and videos is important and in great demand for video retrieval, annotation, indexing and content analysis. In this paper we propose a weighted hybrid thresholding approach. It is demonstrated that the proposed method achieved reasonable accuracy of the text extraction for moderately difficult examples.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Hybrid Binariztion Technique for Historical Manuscripts

This paper presents a new hybrid approach for the binarization and enhancement of Historical Manuscript. This paper deals with degradations which occur due to shadows, non-uniform illumination, low contrast and strain. We follow two distinct method of Binarization with a pre-processing procedure using a adaptive Wiener filter, a rough estimation of foreground regions and a background surface ca...

متن کامل

Text/ Background separation in the degraded document images by combining several thresholding techniques

Extract the text from the background is an important step in all process of document analysis and recognition. If this extraction is easy for document images of good quality by applying simple techniques of global thresholding, the images of degraded documents require a more accurate analysis and we have recourse in this case to local methods. Indeed, these latter are generally more efficient a...

متن کامل

A Hybrid Binarization Technique for Document Images

In this chapter, a binarization technique specifically designed for historical document images is presented. Existing binarization techniques focus either on finding an appropriate global threshold or adapting a local threshold for each area in order to remove smear, strains, uneven illumination etc. Here, a hybrid approach is presented that first applies a global thresholding technique and, th...

متن کامل

A Review on Global Binarization Algorithms for Degraded Document Images

Several algorithms have previously been proposed for improving the thresholding of degraded document images. No algorithm can solve all types of problems, but some algorithms are better than others for specific situations. This article reviews global binarization algorithms for improving degraded document images, thus indicating their differences and similarities, and also their advantages and ...

متن کامل

Combining multiple thresholding binarization values to improve OCR output

For noisy, historical documents, a high optical character recognition (OCR) word error rate (WER) can render the OCR text unusable. Since image binarization is often the method used to identify foreground pixels, a significant body of research has sought to improve image-wide binarization directly. Instead of relying on any one imperfect binarization technique, our method incorporates informati...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012